Serveur d'exploration MERS

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Whole-proteome phylogeny of prokaryotes by feature frequency profiles: An alignment-free method with optimal feature resolution.

Identifieur interne : 002601 ( Main/Exploration ); précédent : 002600; suivant : 002602

Whole-proteome phylogeny of prokaryotes by feature frequency profiles: An alignment-free method with optimal feature resolution.

Auteurs : Se-Ran Jun [États-Unis] ; Gregory E. Sims ; Guohong A. Wu ; Sung-Hou Kim

Source :

RBID : pubmed:20018669

Descripteurs français

English descriptors

Abstract

We present a whole-proteome phylogeny of prokaryotes constructed by comparing feature frequency profiles (FFPs) of whole proteomes. Features are l-mers of amino acids, and each organism is represented by a profile of frequencies of all features. The selection of feature length is critical in the FFP method, and we have developed a procedure for identifying the optimal feature lengths for inferring the phylogeny of prokaryotes, strictly speaking, a proteome phylogeny. Our FFP trees are constructed with whole proteomes of 884 prokaryotes, 16 unicellular eukaryotes, and 2 random sequences. To highlight the branching order of major groups, we present a simplified proteome FFP tree of monophyletic class or phylum with branch support. In our whole-proteome FFP trees (i) Archaea, Bacteria, Eukaryota, and a random sequence outgroup are clearly separated; (ii) Archaea and Bacteria form a sister group when rooted with random sequences; (iii) Planctomycetes, which possesses an intracellular membrane compartment, is placed at the basal position of the Bacteria domain; (iv) almost all groups are monophyletic in prokaryotes at most taxonomic levels, but many differences in the branching order of major groups are observed between our proteome FFP tree and trees built with other methods; and (v) previously "unclassified" genomes may be assigned to the most likely taxa. We describe notable similarities and differences between our FFP trees and those based on other methods in grouping and phylogeny of prokaryotes.

DOI: 10.1073/pnas.0913033107
PubMed: 20018669


Affiliations:


Links toward previous steps (curation, corpus...)


Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">Whole-proteome phylogeny of prokaryotes by feature frequency profiles: An alignment-free method with optimal feature resolution.</title>
<author>
<name sortKey="Jun, Se Ran" sort="Jun, Se Ran" uniqKey="Jun S" first="Se-Ran" last="Jun">Se-Ran Jun</name>
<affiliation wicri:level="2">
<nlm:affiliation>Department of Chemistry, University of California, Berkeley, CA 94720, USA.</nlm:affiliation>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Department of Chemistry, University of California, Berkeley, CA 94720</wicri:regionArea>
<placeName>
<region type="state">Californie</region>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Sims, Gregory E" sort="Sims, Gregory E" uniqKey="Sims G" first="Gregory E" last="Sims">Gregory E. Sims</name>
</author>
<author>
<name sortKey="Wu, Guohong A" sort="Wu, Guohong A" uniqKey="Wu G" first="Guohong A" last="Wu">Guohong A. Wu</name>
</author>
<author>
<name sortKey="Kim, Sung Hou" sort="Kim, Sung Hou" uniqKey="Kim S" first="Sung-Hou" last="Kim">Sung-Hou Kim</name>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">PubMed</idno>
<date when="2010">2010</date>
<idno type="RBID">pubmed:20018669</idno>
<idno type="pmid">20018669</idno>
<idno type="doi">10.1073/pnas.0913033107</idno>
<idno type="wicri:Area/PubMed/Corpus">001F80</idno>
<idno type="wicri:explorRef" wicri:stream="PubMed" wicri:step="Corpus" wicri:corpus="PubMed">001F80</idno>
<idno type="wicri:Area/PubMed/Curation">001F80</idno>
<idno type="wicri:explorRef" wicri:stream="PubMed" wicri:step="Curation">001F80</idno>
<idno type="wicri:Area/PubMed/Checkpoint">001E12</idno>
<idno type="wicri:explorRef" wicri:stream="Checkpoint" wicri:step="PubMed">001E12</idno>
<idno type="wicri:Area/Ncbi/Merge">000726</idno>
<idno type="wicri:Area/Ncbi/Curation">000726</idno>
<idno type="wicri:Area/Ncbi/Checkpoint">000726</idno>
<idno type="wicri:Area/Main/Merge">002626</idno>
<idno type="wicri:Area/Main/Curation">002601</idno>
<idno type="wicri:Area/Main/Exploration">002601</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en">Whole-proteome phylogeny of prokaryotes by feature frequency profiles: An alignment-free method with optimal feature resolution.</title>
<author>
<name sortKey="Jun, Se Ran" sort="Jun, Se Ran" uniqKey="Jun S" first="Se-Ran" last="Jun">Se-Ran Jun</name>
<affiliation wicri:level="2">
<nlm:affiliation>Department of Chemistry, University of California, Berkeley, CA 94720, USA.</nlm:affiliation>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Department of Chemistry, University of California, Berkeley, CA 94720</wicri:regionArea>
<placeName>
<region type="state">Californie</region>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Sims, Gregory E" sort="Sims, Gregory E" uniqKey="Sims G" first="Gregory E" last="Sims">Gregory E. Sims</name>
</author>
<author>
<name sortKey="Wu, Guohong A" sort="Wu, Guohong A" uniqKey="Wu G" first="Guohong A" last="Wu">Guohong A. Wu</name>
</author>
<author>
<name sortKey="Kim, Sung Hou" sort="Kim, Sung Hou" uniqKey="Kim S" first="Sung-Hou" last="Kim">Sung-Hou Kim</name>
</author>
</analytic>
<series>
<title level="j">Proceedings of the National Academy of Sciences of the United States of America</title>
<idno type="eISSN">1091-6490</idno>
<imprint>
<date when="2010" type="published">2010</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="KwdEn" xml:lang="en">
<term>Genome</term>
<term>Phylogeny</term>
<term>Prokaryotic Cells (classification)</term>
<term>Prokaryotic Cells (physiology)</term>
<term>Proteome (genetics)</term>
<term>Proteomics (methods)</term>
<term>Sequence Alignment (methods)</term>
<term>Sequence Analysis, Protein (methods)</term>
</keywords>
<keywords scheme="KwdFr" xml:lang="fr">
<term>Alignement de séquences ()</term>
<term>Analyse de séquence de protéine ()</term>
<term>Cellules procaryotes ()</term>
<term>Cellules procaryotes (physiologie)</term>
<term>Génome</term>
<term>Phylogénie</term>
<term>Protéome (génétique)</term>
<term>Protéomique ()</term>
</keywords>
<keywords scheme="MESH" type="chemical" qualifier="genetics" xml:lang="en">
<term>Proteome</term>
</keywords>
<keywords scheme="MESH" qualifier="classification" xml:lang="en">
<term>Prokaryotic Cells</term>
</keywords>
<keywords scheme="MESH" qualifier="génétique" xml:lang="fr">
<term>Protéome</term>
</keywords>
<keywords scheme="MESH" qualifier="methods" xml:lang="en">
<term>Proteomics</term>
<term>Sequence Alignment</term>
<term>Sequence Analysis, Protein</term>
</keywords>
<keywords scheme="MESH" qualifier="physiologie" xml:lang="fr">
<term>Cellules procaryotes</term>
</keywords>
<keywords scheme="MESH" qualifier="physiology" xml:lang="en">
<term>Prokaryotic Cells</term>
</keywords>
<keywords scheme="MESH" xml:lang="en">
<term>Genome</term>
<term>Phylogeny</term>
</keywords>
<keywords scheme="MESH" xml:lang="fr">
<term>Alignement de séquences</term>
<term>Analyse de séquence de protéine</term>
<term>Cellules procaryotes</term>
<term>Génome</term>
<term>Phylogénie</term>
<term>Protéomique</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">We present a whole-proteome phylogeny of prokaryotes constructed by comparing feature frequency profiles (FFPs) of whole proteomes. Features are l-mers of amino acids, and each organism is represented by a profile of frequencies of all features. The selection of feature length is critical in the FFP method, and we have developed a procedure for identifying the optimal feature lengths for inferring the phylogeny of prokaryotes, strictly speaking, a proteome phylogeny. Our FFP trees are constructed with whole proteomes of 884 prokaryotes, 16 unicellular eukaryotes, and 2 random sequences. To highlight the branching order of major groups, we present a simplified proteome FFP tree of monophyletic class or phylum with branch support. In our whole-proteome FFP trees (i) Archaea, Bacteria, Eukaryota, and a random sequence outgroup are clearly separated; (ii) Archaea and Bacteria form a sister group when rooted with random sequences; (iii) Planctomycetes, which possesses an intracellular membrane compartment, is placed at the basal position of the Bacteria domain; (iv) almost all groups are monophyletic in prokaryotes at most taxonomic levels, but many differences in the branching order of major groups are observed between our proteome FFP tree and trees built with other methods; and (v) previously "unclassified" genomes may be assigned to the most likely taxa. We describe notable similarities and differences between our FFP trees and those based on other methods in grouping and phylogeny of prokaryotes.</div>
</front>
</TEI>
<affiliations>
<list>
<country>
<li>États-Unis</li>
</country>
<region>
<li>Californie</li>
</region>
</list>
<tree>
<noCountry>
<name sortKey="Kim, Sung Hou" sort="Kim, Sung Hou" uniqKey="Kim S" first="Sung-Hou" last="Kim">Sung-Hou Kim</name>
<name sortKey="Sims, Gregory E" sort="Sims, Gregory E" uniqKey="Sims G" first="Gregory E" last="Sims">Gregory E. Sims</name>
<name sortKey="Wu, Guohong A" sort="Wu, Guohong A" uniqKey="Wu G" first="Guohong A" last="Wu">Guohong A. Wu</name>
</noCountry>
<country name="États-Unis">
<region name="Californie">
<name sortKey="Jun, Se Ran" sort="Jun, Se Ran" uniqKey="Jun S" first="Se-Ran" last="Jun">Se-Ran Jun</name>
</region>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Sante/explor/MersV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 002601 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 002601 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Sante
   |area=    MersV1
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     pubmed:20018669
   |texte=   Whole-proteome phylogeny of prokaryotes by feature frequency profiles: An alignment-free method with optimal feature resolution.
}}

Pour générer des pages wiki

HfdIndexSelect -h $EXPLOR_AREA/Data/Main/Exploration/RBID.i   -Sk "pubmed:20018669" \
       | HfdSelect -Kh $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd   \
       | NlmPubMed2Wicri -a MersV1 

Wicri

This area was generated with Dilib version V0.6.33.
Data generation: Mon Apr 20 23:26:43 2020. Site generation: Sat Mar 27 09:06:09 2021